Overview
Brought to you by YData
Dataset statistics
| Number of variables | 23 |
|---|---|
| Number of observations | 899164 |
| Missing cells | 748090 |
| Missing cells (%) | 3.6% |
| Duplicate rows | 141 |
| Duplicate rows (%) | < 0.1% |
| Total size in memory | 157.8 MiB |
| Average record size in memory | 184.0 B |
Variable types
| Text | 6 |
|---|---|
| Numeric | 7 |
| DateTime | 3 |
| Categorical | 7 |
| Dataset has 141 (< 0.1%) duplicate rows | Duplicates |
MIS_Status is highly overall correlated with TARGET | High correlation |
TARGET is highly overall correlated with MIS_Status | High correlation |
RevLineCr is highly imbalanced (61.3%) | Imbalance |
LowDoc is highly imbalanced (80.6%) | Imbalance |
BalanceGross is highly imbalanced (> 99.9%) | Imbalance |
ChgOffDate has 736465 (81.9%) missing values | Missing |
NoEmp is highly skewed (γ1 = 80.24824355) | Skewed |
CreateJob is highly skewed (γ1 = 36.99135473) | Skewed |
RetainedJob is highly skewed (γ1 = 36.85481184) | Skewed |
NAICS has 201948 (22.5%) zeros | Zeros |
CreateJob has 629248 (70.0%) zeros | Zeros |
RetainedJob has 440403 (49.0%) zeros | Zeros |
FranchiseCode has 208835 (23.2%) zeros | Zeros |
Reproduction
| Analysis started | 2025-02-11 08:48:10.221764 |
|---|---|
| Analysis finished | 2025-02-11 08:49:22.531646 |
| Duration | 1 minute and 12.31 seconds |
| Software version | ydata-profiling vv4.12.2 |
| Download configuration | config.json |
Variables
State
Text
| Distinct | 51 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 14 |
| Missing (%) | < 0.1% |
| Memory size | 6.9 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | IN |
|---|---|
| 2nd row | IN |
| 3rd row | IN |
| 4th row | OK |
| 5th row | FL |
| Value | Count | Frequency (%) |
| ca | 130619 | 14.5% |
| tx | 70458 | 7.8% |
| ny | 57693 | 6.4% |
| fl | 41212 | 4.6% |
| pa | 35170 | 3.9% |
| oh | 32622 | 3.6% |
| il | 29669 | 3.3% |
| ma | 25272 | 2.8% |
| mn | 24373 | 2.7% |
| nj | 24035 | 2.7% |
| Other values (41) | 428027 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 306176 | |
| C | 184957 | |
| N | 181727 | |
| M | 132549 | 7.4% |
| T | 125069 | 7.0% |
| I | 119518 | 6.6% |
| O | 94906 | 5.3% |
| L | 88819 | 4.9% |
| X | 70458 | 3.9% |
| Y | 68255 | 3.8% |
| Other values (14) | 425866 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1798300 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 306176 | |
| C | 184957 | |
| N | 181727 | |
| M | 132549 | 7.4% |
| T | 125069 | 7.0% |
| I | 119518 | 6.6% |
| O | 94906 | 5.3% |
| L | 88819 | 4.9% |
| X | 70458 | 3.9% |
| Y | 68255 | 3.8% |
| Other values (14) | 425866 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1798300 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 306176 | |
| C | 184957 | |
| N | 181727 | |
| M | 132549 | 7.4% |
| T | 125069 | 7.0% |
| I | 119518 | 6.6% |
| O | 94906 | 5.3% |
| L | 88819 | 4.9% |
| X | 70458 | 3.9% |
| Y | 68255 | 3.8% |
| Other values (14) | 425866 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1798300 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 306176 | |
| C | 184957 | |
| N | 181727 | |
| M | 132549 | 7.4% |
| T | 125069 | 7.0% |
| I | 119518 | 6.6% |
| O | 94906 | 5.3% |
| L | 88819 | 4.9% |
| X | 70458 | 3.9% |
| Y | 68255 | 3.8% |
| Other values (14) | 425866 |
Zip
Real number (ℝ)
| Distinct | 33611 |
|---|---|
| Distinct (%) | 3.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 53804.391 |
| Minimum | 0 |
|---|---|
| Maximum | 99999 |
| Zeros | 283 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.9 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 3838 |
| Q1 | 27587 |
| median | 55410 |
| Q3 | 83704 |
| 95-th percentile | 95822 |
| Maximum | 99999 |
| Range | 99999 |
| Interquartile range (IQR) | 56117 |
Descriptive statistics
| Standard deviation | 31184.159 |
|---|---|
| Coefficient of variation (CV) | 0.5795839 |
| Kurtosis | -1.3359893 |
| Mean | 53804.391 |
| Median Absolute Deviation (MAD) | 28206 |
| Skewness | -0.16816663 |
| Sum | 4.8378972 × 1010 |
| Variance | 9.7245178 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10001 | 933 | 0.1% |
| 90015 | 926 | 0.1% |
| 93401 | 806 | 0.1% |
| 90010 | 733 | 0.1% |
| 33166 | 671 | 0.1% |
| 90021 | 666 | 0.1% |
| 59601 | 640 | 0.1% |
| 65804 | 599 | 0.1% |
| 3801 | 581 | 0.1% |
| 59101 | 578 | 0.1% |
| Other values (33601) | 892031 |
| Value | Count | Frequency (%) |
| 0 | 283 | |
| 1 | 24 | < 0.1% |
| 2 | 11 | < 0.1% |
| 3 | 5 | < 0.1% |
| 4 | 5 | < 0.1% |
| 5 | 5 | < 0.1% |
| 6 | 4 | < 0.1% |
| 7 | 6 | < 0.1% |
| 8 | 15 | < 0.1% |
| 9 | 24 | < 0.1% |
| Value | Count | Frequency (%) |
| 99999 | 209 | |
| 99950 | 3 | < 0.1% |
| 99929 | 15 | < 0.1% |
| 99928 | 1 | < 0.1% |
| 99926 | 1 | < 0.1% |
| 99925 | 4 | < 0.1% |
| 99923 | 1 | < 0.1% |
| 99921 | 13 | < 0.1% |
| 99919 | 2 | < 0.1% |
| 99918 | 1 | < 0.1% |
NAICS
Real number (ℝ)
Zeros 
| Distinct | 1312 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 398660.95 |
| Minimum | 0 |
|---|---|
| Maximum | 928120 |
| Zeros | 201948 |
| Zeros (%) | 22.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.9 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 235210 |
| median | 445310 |
| Q3 | 561730 |
| 95-th percentile | 811192 |
| Maximum | 928120 |
| Range | 928120 |
| Interquartile range (IQR) | 326520 |
Descriptive statistics
| Standard deviation | 263318.31 |
|---|---|
| Coefficient of variation (CV) | 0.66050691 |
| Kurtosis | -1.0476526 |
| Mean | 398660.95 |
| Median Absolute Deviation (MAD) | 176300 |
| Skewness | -0.26287834 |
| Sum | 3.5846157 × 1011 |
| Variance | 6.9336534 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 201948 | 22.5% |
| 722110 | 27989 | 3.1% |
| 722211 | 19448 | 2.2% |
| 811111 | 14585 | 1.6% |
| 621210 | 14048 | 1.6% |
| 624410 | 10111 | 1.1% |
| 812112 | 9230 | 1.0% |
| 561730 | 8935 | 1.0% |
| 621310 | 8733 | 1.0% |
| 812320 | 7894 | 0.9% |
| Other values (1302) | 576243 |
| Value | Count | Frequency (%) |
| 0 | 201948 | |
| 111110 | 32 | < 0.1% |
| 111120 | 3 | < 0.1% |
| 111130 | 1 | < 0.1% |
| 111140 | 94 | < 0.1% |
| 111150 | 49 | < 0.1% |
| 111160 | 2 | < 0.1% |
| 111191 | 3 | < 0.1% |
| 111199 | 7 | < 0.1% |
| 111211 | 16 | < 0.1% |
| Value | Count | Frequency (%) |
| 928120 | 32 | |
| 928110 | 4 | < 0.1% |
| 927110 | 1 | < 0.1% |
| 926150 | 10 | < 0.1% |
| 926140 | 6 | < 0.1% |
| 926130 | 3 | < 0.1% |
| 926120 | 5 | < 0.1% |
| 926110 | 6 | < 0.1% |
| 925120 | 1 | < 0.1% |
| 925110 | 3 | < 0.1% |
ApprovalDate
Date
| Distinct | 9859 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.9 MiB |
| Minimum | 1975-01-20 00:00:00 |
|---|---|
| Maximum | 2074-12-17 00:00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
ApprovalFY
Text
| Distinct | 52 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.9 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 4 |
| Mean length | 4.00002 |
| Min length | 4 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 1997 |
|---|---|
| 2nd row | 1997 |
| 3rd row | 1997 |
| 4th row | 1997 |
| 5th row | 1997 |
| Value | Count | Frequency (%) |
| 2005 | 77525 | 8.6% |
| 2006 | 76040 | 8.5% |
| 2007 | 71876 | 8.0% |
| 2004 | 68290 | 7.6% |
| 2003 | 58193 | 6.5% |
| 1995 | 45758 | 5.1% |
| 2002 | 44391 | 4.9% |
| 1996 | 40112 | 4.5% |
| 2008 | 39540 | 4.4% |
| 1997 | 37748 | 4.2% |
| Other values (42) | 339691 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1167176 | |
| 9 | 704676 | |
| 2 | 639911 | |
| 1 | 435726 | 12.1% |
| 5 | 125258 | 3.5% |
| 6 | 118366 | 3.3% |
| 7 | 112975 | 3.1% |
| 8 | 104656 | 2.9% |
| 4 | 102220 | 2.8% |
| 3 | 85692 | 2.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3596656 | |
| Uppercase Letter | 18 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1167176 | |
| 9 | 704676 | |
| 2 | 639911 | |
| 1 | 435726 | 12.1% |
| 5 | 125258 | 3.5% |
| 6 | 118366 | 3.3% |
| 7 | 112975 | 3.1% |
| 8 | 104656 | 2.9% |
| 4 | 102220 | 2.8% |
| 3 | 85692 | 2.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 18 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3596656 | |
| Latin | 18 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1167176 | |
| 9 | 704676 | |
| 2 | 639911 | |
| 1 | 435726 | 12.1% |
| 5 | 125258 | 3.5% |
| 6 | 118366 | 3.3% |
| 7 | 112975 | 3.1% |
| 8 | 104656 | 2.9% |
| 4 | 102220 | 2.8% |
| 3 | 85692 | 2.4% |
Latin
| Value | Count | Frequency (%) |
| A | 18 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3596674 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1167176 | |
| 9 | 704676 | |
| 2 | 639911 | |
| 1 | 435726 | 12.1% |
| 5 | 125258 | 3.5% |
| 6 | 118366 | 3.3% |
| 7 | 112975 | 3.1% |
| 8 | 104656 | 2.9% |
| 4 | 102220 | 2.8% |
| 3 | 85692 | 2.4% |
Term
Real number (ℝ)
| Distinct | 412 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 110.77308 |
| Minimum | 0 |
|---|---|
| Maximum | 569 |
| Zeros | 810 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.9 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 16 |
| Q1 | 60 |
| median | 84 |
| Q3 | 120 |
| 95-th percentile | 300 |
| Maximum | 569 |
| Range | 569 |
| Interquartile range (IQR) | 60 |
Descriptive statistics
| Standard deviation | 78.857305 |
|---|---|
| Coefficient of variation (CV) | 0.7118815 |
| Kurtosis | 0.18570424 |
| Mean | 110.77308 |
| Median Absolute Deviation (MAD) | 33 |
| Skewness | 1.1209258 |
| Sum | 99603164 |
| Variance | 6218.4746 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 84 | 230162 | |
| 60 | 89945 | 10.0% |
| 240 | 85982 | 9.6% |
| 120 | 77654 | 8.6% |
| 300 | 44727 | 5.0% |
| 180 | 28164 | 3.1% |
| 36 | 19800 | 2.2% |
| 12 | 17095 | 1.9% |
| 48 | 15621 | 1.7% |
| 72 | 9419 | 1.0% |
| Other values (402) | 280595 |
| Value | Count | Frequency (%) |
| 0 | 810 | 0.1% |
| 1 | 1608 | |
| 2 | 1809 | |
| 3 | 2112 | |
| 4 | 2173 | |
| 5 | 1866 | |
| 6 | 3054 | |
| 7 | 1761 | |
| 8 | 1693 | |
| 9 | 1875 |
| Value | Count | Frequency (%) |
| 569 | 1 | |
| 527 | 1 | |
| 511 | 1 | |
| 505 | 1 | |
| 481 | 1 | |
| 480 | 1 | |
| 461 | 1 | |
| 449 | 1 | |
| 445 | 1 | |
| 443 | 1 |
NoEmp
Real number (ℝ)
Skewed 
| Distinct | 599 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11.411353 |
| Minimum | 0 |
|---|---|
| Maximum | 9999 |
| Zeros | 6631 |
| Zeros (%) | 0.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.9 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 4 |
| Q3 | 10 |
| 95-th percentile | 40 |
| Maximum | 9999 |
| Range | 9999 |
| Interquartile range (IQR) | 8 |
Descriptive statistics
| Standard deviation | 74.108196 |
|---|---|
| Coefficient of variation (CV) | 6.4942514 |
| Kurtosis | 7965.2886 |
| Mean | 11.411353 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 80.248244 |
| Sum | 10260678 |
| Variance | 5492.0248 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 154254 | |
| 2 | 138297 | |
| 3 | 90674 | |
| 4 | 73644 | 8.2% |
| 5 | 60319 | 6.7% |
| 6 | 45759 | 5.1% |
| 10 | 31536 | 3.5% |
| 7 | 31495 | 3.5% |
| 8 | 31361 | 3.5% |
| 12 | 20822 | 2.3% |
| Other values (589) | 221003 |
| Value | Count | Frequency (%) |
| 0 | 6631 | 0.7% |
| 1 | 154254 | |
| 2 | 138297 | |
| 3 | 90674 | |
| 4 | 73644 | |
| 5 | 60319 | 6.7% |
| 6 | 45759 | 5.1% |
| 7 | 31495 | 3.5% |
| 8 | 31361 | 3.5% |
| 9 | 18131 | 2.0% |
| Value | Count | Frequency (%) |
| 9999 | 4 | |
| 9992 | 1 | < 0.1% |
| 9945 | 1 | < 0.1% |
| 9090 | 1 | < 0.1% |
| 9000 | 2 | < 0.1% |
| 8500 | 1 | < 0.1% |
| 8041 | 1 | < 0.1% |
| 8018 | 1 | < 0.1% |
| 8000 | 7 | |
| 7999 | 1 | < 0.1% |
NewExist
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 136 |
| Missing (%) | < 0.1% |
| Memory size | 6.9 MiB |
| 1.0 | |
|---|---|
| 2.0 | |
| 0.0 | 1034 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2.0 |
|---|---|
| 2nd row | 2.0 |
| 3rd row | 1.0 |
| 4th row | 1.0 |
| 5th row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 644869 | |
| 2.0 | 253125 | 28.2% |
| 0.0 | 1034 | 0.1% |
| (Missing) | 136 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1.0 | 644869 | |
| 2.0 | 253125 | 28.2% |
| 0.0 | 1034 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 900062 | |
| . | 899028 | |
| 1 | 644869 | |
| 2 | 253125 | 9.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1798056 | |
| Other Punctuation | 899028 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 900062 | |
| 1 | 644869 | |
| 2 | 253125 | 14.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 899028 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2697084 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 900062 | |
| . | 899028 | |
| 1 | 644869 | |
| 2 | 253125 | 9.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2697084 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 900062 | |
| . | 899028 | |
| 1 | 644869 | |
| 2 | 253125 | 9.4% |
CreateJob
Real number (ℝ)
Skewed  Zeros 
| Distinct | 246 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8.4303764 |
| Minimum | 0 |
|---|---|
| Maximum | 8800 |
| Zeros | 629248 |
| Zeros (%) | 70.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.9 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 10 |
| Maximum | 8800 |
| Range | 8800 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 236.68817 |
|---|---|
| Coefficient of variation (CV) | 28.075634 |
| Kurtosis | 1369.911 |
| Mean | 8.4303764 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 36.991355 |
| Sum | 7580291 |
| Variance | 56021.288 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 629248 | |
| 1 | 63174 | 7.0% |
| 2 | 57831 | 6.4% |
| 3 | 28806 | 3.2% |
| 4 | 20511 | 2.3% |
| 5 | 18691 | 2.1% |
| 10 | 11602 | 1.3% |
| 6 | 11009 | 1.2% |
| 8 | 7378 | 0.8% |
| 7 | 6374 | 0.7% |
| Other values (236) | 44540 | 5.0% |
| Value | Count | Frequency (%) |
| 0 | 629248 | |
| 1 | 63174 | 7.0% |
| 2 | 57831 | 6.4% |
| 3 | 28806 | 3.2% |
| 4 | 20511 | 2.3% |
| 5 | 18691 | 2.1% |
| 6 | 11009 | 1.2% |
| 7 | 6374 | 0.7% |
| 8 | 7378 | 0.8% |
| 9 | 3330 | 0.4% |
| Value | Count | Frequency (%) |
| 8800 | 648 | |
| 5621 | 1 | < 0.1% |
| 5199 | 1 | < 0.1% |
| 5085 | 1 | < 0.1% |
| 3500 | 1 | < 0.1% |
| 3100 | 1 | < 0.1% |
| 3000 | 4 | < 0.1% |
| 2515 | 1 | < 0.1% |
| 2140 | 1 | < 0.1% |
| 2020 | 1 | < 0.1% |
RetainedJob
Real number (ℝ)
Skewed  Zeros 
| Distinct | 358 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10.797257 |
| Minimum | 0 |
|---|---|
| Maximum | 9500 |
| Zeros | 440403 |
| Zeros (%) | 49.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.9 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 4 |
| 95-th percentile | 20 |
| Maximum | 9500 |
| Range | 9500 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 237.1206 |
|---|---|
| Coefficient of variation (CV) | 21.961188 |
| Kurtosis | 1362.0182 |
| Mean | 10.797257 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 36.854812 |
| Sum | 9708505 |
| Variance | 56226.179 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 440403 | |
| 1 | 88790 | 9.9% |
| 2 | 76851 | 8.5% |
| 3 | 49963 | 5.6% |
| 4 | 39666 | 4.4% |
| 5 | 32627 | 3.6% |
| 6 | 23796 | 2.6% |
| 7 | 16530 | 1.8% |
| 8 | 15698 | 1.7% |
| 10 | 15438 | 1.7% |
| Other values (348) | 99402 | 11.1% |
| Value | Count | Frequency (%) |
| 0 | 440403 | |
| 1 | 88790 | 9.9% |
| 2 | 76851 | 8.5% |
| 3 | 49963 | 5.6% |
| 4 | 39666 | 4.4% |
| 5 | 32627 | 3.6% |
| 6 | 23796 | 2.6% |
| 7 | 16530 | 1.8% |
| 8 | 15698 | 1.7% |
| 9 | 8735 | 1.0% |
| Value | Count | Frequency (%) |
| 9500 | 1 | < 0.1% |
| 8800 | 648 | |
| 7250 | 1 | < 0.1% |
| 5000 | 1 | < 0.1% |
| 4441 | 1 | < 0.1% |
| 4000 | 2 | < 0.1% |
| 3900 | 1 | < 0.1% |
| 3860 | 1 | < 0.1% |
| 3225 | 1 | < 0.1% |
| 3200 | 1 | < 0.1% |
FranchiseCode
Real number (ℝ)
Zeros 
| Distinct | 2768 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2753.7259 |
| Minimum | 0 |
|---|---|
| Maximum | 99999 |
| Zeros | 208835 |
| Zeros (%) | 23.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.9 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 15805 |
| Maximum | 99999 |
| Range | 99999 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 12758.019 |
|---|---|
| Coefficient of variation (CV) | 4.6330025 |
| Kurtosis | 24.409524 |
| Mean | 2753.7259 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 4.9752152 |
| Sum | 2.4760512 × 109 |
| Variance | 1.6276705 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 638554 | |
| 0 | 208835 | 23.2% |
| 78760 | 3373 | 0.4% |
| 68020 | 1921 | 0.2% |
| 50564 | 1034 | 0.1% |
| 21780 | 1003 | 0.1% |
| 25650 | 715 | 0.1% |
| 79140 | 659 | 0.1% |
| 22470 | 615 | 0.1% |
| 17998 | 606 | 0.1% |
| Other values (2758) | 41849 | 4.7% |
| Value | Count | Frequency (%) |
| 0 | 208835 | 23.2% |
| 1 | 638554 | |
| 3 | 12 | < 0.1% |
| 395 | 5 | < 0.1% |
| 399 | 3 | < 0.1% |
| 400 | 2 | < 0.1% |
| 401 | 12 | < 0.1% |
| 404 | 1 | < 0.1% |
| 407 | 34 | < 0.1% |
| 414 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 99999 | 1 | < 0.1% |
| 92006 | 4 | < 0.1% |
| 92000 | 9 | |
| 91999 | 11 | |
| 91450 | 2 | < 0.1% |
| 91446 | 1 | < 0.1% |
| 91443 | 2 | < 0.1% |
| 91435 | 1 | < 0.1% |
| 91424 | 1 | < 0.1% |
| 91423 | 2 | < 0.1% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 470654 | |
| 0 | 323167 | |
| 2 | 105343 | 11.7% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 470654 | |
| 0 | 323167 | |
| 2 | 105343 | 11.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 470654 | |
| 0 | 323167 | |
| 2 | 105343 | 11.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 899164 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 470654 | |
| 0 | 323167 | |
| 2 | 105343 | 11.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 899164 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 470654 | |
| 0 | 323167 | |
| 2 | 105343 | 11.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 899164 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 470654 | |
| 0 | 323167 | |
| 2 | 105343 | 11.7% |
RevLineCr
Categorical
Imbalance 
| Distinct | 18 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 4528 |
| Missing (%) | 0.5% |
| Memory size | 6.9 MiB |
| N | |
|---|---|
| 0 | |
| Y | |
| T | 15284 |
| 1 | 23 |
| Other values (13) | 42 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 9 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | N |
|---|---|
| 2nd row | N |
| 3rd row | N |
| 4th row | N |
| 5th row | N |
Common Values
| Value | Count | Frequency (%) |
| N | 420288 | |
| 0 | 257602 | |
| Y | 201397 | |
| T | 15284 | 1.7% |
| 1 | 23 | < 0.1% |
| R | 14 | < 0.1% |
| ` | 11 | < 0.1% |
| 2 | 6 | < 0.1% |
| C | 2 | < 0.1% |
| , | 1 | < 0.1% |
| Other values (8) | 8 | < 0.1% |
| (Missing) | 4528 | 0.5% |
Length
| Value | Count | Frequency (%) |
| n | 420288 | |
| 0 | 257602 | |
| y | 201397 | |
| t | 15284 | 1.7% |
| 1 | 23 | < 0.1% |
| r | 14 | < 0.1% |
| 14 | < 0.1% | |
| 2 | 6 | < 0.1% |
| c | 2 | < 0.1% |
| 3 | 1 | < 0.1% |
| Other values (5) | 5 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 420288 | |
| 0 | 257602 | |
| Y | 201397 | |
| T | 15284 | 1.7% |
| 1 | 23 | < 0.1% |
| R | 14 | < 0.1% |
| ` | 11 | < 0.1% |
| 2 | 6 | < 0.1% |
| C | 2 | < 0.1% |
| , | 1 | < 0.1% |
| Other values (8) | 8 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 636987 | |
| Decimal Number | 257635 | |
| Modifier Symbol | 11 | < 0.1% |
| Other Punctuation | 2 | < 0.1% |
| Dash Punctuation | 1 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 420288 | |
| Y | 201397 | |
| T | 15284 | 2.4% |
| R | 14 | < 0.1% |
| C | 2 | < 0.1% |
| A | 1 | < 0.1% |
| Q | 1 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 257602 | |
| 1 | 23 | < 0.1% |
| 2 | 6 | < 0.1% |
| 3 | 1 | < 0.1% |
| 7 | 1 | < 0.1% |
| 5 | 1 | < 0.1% |
| 4 | 1 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 1 | |
| . | 1 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 11 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 636987 | |
| Common | 257649 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 257602 | |
| 1 | 23 | < 0.1% |
| ` | 11 | < 0.1% |
| 2 | 6 | < 0.1% |
| , | 1 | < 0.1% |
| 3 | 1 | < 0.1% |
| 7 | 1 | < 0.1% |
| 5 | 1 | < 0.1% |
| . | 1 | < 0.1% |
| 4 | 1 | < 0.1% |
Latin
| Value | Count | Frequency (%) |
| N | 420288 | |
| Y | 201397 | |
| T | 15284 | 2.4% |
| R | 14 | < 0.1% |
| C | 2 | < 0.1% |
| A | 1 | < 0.1% |
| Q | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 894636 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 420288 | |
| 0 | 257602 | |
| Y | 201397 | |
| T | 15284 | 1.7% |
| 1 | 23 | < 0.1% |
| R | 14 | < 0.1% |
| ` | 11 | < 0.1% |
| 2 | 6 | < 0.1% |
| C | 2 | < 0.1% |
| , | 1 | < 0.1% |
| Other values (8) | 8 | < 0.1% |
LowDoc
Categorical
Imbalance 
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2582 |
| Missing (%) | 0.3% |
| Memory size | 6.9 MiB |
| N | |
|---|---|
| Y | |
| 0 | 1491 |
| C | 758 |
| S | 603 |
| Other values (3) | 573 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Y |
|---|---|
| 2nd row | Y |
| 3rd row | N |
| 4th row | Y |
| 5th row | N |
Common Values
| Value | Count | Frequency (%) |
| N | 782822 | |
| Y | 110335 | 12.3% |
| 0 | 1491 | 0.2% |
| C | 758 | 0.1% |
| S | 603 | 0.1% |
| A | 497 | 0.1% |
| R | 75 | < 0.1% |
| 1 | 1 | < 0.1% |
| (Missing) | 2582 | 0.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| n | 782822 | |
| y | 110335 | 12.3% |
| 0 | 1491 | 0.2% |
| c | 758 | 0.1% |
| s | 603 | 0.1% |
| a | 497 | 0.1% |
| r | 75 | < 0.1% |
| 1 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 782822 | |
| Y | 110335 | 12.3% |
| 0 | 1491 | 0.2% |
| C | 758 | 0.1% |
| S | 603 | 0.1% |
| A | 497 | 0.1% |
| R | 75 | < 0.1% |
| 1 | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 895090 | |
| Decimal Number | 1492 | 0.2% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 782822 | |
| Y | 110335 | 12.3% |
| C | 758 | 0.1% |
| S | 603 | 0.1% |
| A | 497 | 0.1% |
| R | 75 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1491 | |
| 1 | 1 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 895090 | |
| Common | 1492 | 0.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 782822 | |
| Y | 110335 | 12.3% |
| C | 758 | 0.1% |
| S | 603 | 0.1% |
| A | 497 | 0.1% |
| R | 75 | < 0.1% |
Common
| Value | Count | Frequency (%) |
| 0 | 1491 | |
| 1 | 1 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 896582 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 782822 | |
| Y | 110335 | 12.3% |
| 0 | 1491 | 0.2% |
| C | 758 | 0.1% |
| S | 603 | 0.1% |
| A | 497 | 0.1% |
| R | 75 | < 0.1% |
| 1 | 1 | < 0.1% |
ChgOffDate
Date
Missing 
| Distinct | 6448 |
|---|---|
| Distinct (%) | 4.0% |
| Missing | 736465 |
| Missing (%) | 81.9% |
| Memory size | 6.9 MiB |
| Minimum | 1988-10-03 00:00:00 |
|---|---|
| Maximum | 2026-10-22 00:00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
DisbursementDate
Date
| Distinct | 8472 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 2368 |
| Missing (%) | 0.3% |
| Memory size | 6.9 MiB |
| Minimum | 1975-01-17 00:00:00 |
|---|---|
| Maximum | 2074-12-04 00:00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
| Distinct | 118859 |
|---|---|
| Distinct (%) | 13.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.9 MiB |
Length
| Max length | 15 |
|---|---|
| Median length | 14 |
| Mean length | 11.537586 |
| Min length | 6 |
Unique
| Unique | 79785 ? |
|---|---|
| Unique (%) | 8.9% |
Sample
| 1st row | $60,000.00 |
|---|---|
| 2nd row | $40,000.00 |
| 3rd row | $287,000.00 |
| 4th row | $35,000.00 |
| 5th row | $229,000.00 |
| Value | Count | Frequency (%) |
| 50,000.00 | 43787 | 4.9% |
| 100,000.00 | 36714 | 4.1% |
| 25,000.00 | 27387 | 3.0% |
| 150,000.00 | 23373 | 2.6% |
| 10,000.00 | 21328 | 2.4% |
| 35,000.00 | 14748 | 1.6% |
| 5,000.00 | 14193 | 1.6% |
| 75,000.00 | 13528 | 1.5% |
| 20,000.00 | 13462 | 1.5% |
| 30,000.00 | 12696 | 1.4% |
| Other values (118849) | 677948 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 4457089 | |
| , | 924978 | 8.9% |
| . | 899164 | 8.7% |
| $ | 899164 | 8.7% |
| 899164 | 8.7% | |
| 5 | 445569 | 4.3% |
| 1 | 409947 | 4.0% |
| 2 | 312909 | 3.0% |
| 3 | 238773 | 2.3% |
| 4 | 207077 | 2.0% |
| Other values (4) | 680348 | 6.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 6751712 | |
| Other Punctuation | 1824142 | 17.6% |
| Currency Symbol | 899164 | 8.7% |
| Space Separator | 899164 | 8.7% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 4457089 | |
| 5 | 445569 | 6.6% |
| 1 | 409947 | 6.1% |
| 2 | 312909 | 4.6% |
| 3 | 238773 | 3.5% |
| 4 | 207077 | 3.1% |
| 7 | 183883 | 2.7% |
| 6 | 177786 | 2.6% |
| 8 | 162618 | 2.4% |
| 9 | 156061 | 2.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 924978 | |
| . | 899164 |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 899164 |
Space Separator
| Value | Count | Frequency (%) |
| 899164 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 10374182 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 4457089 | |
| , | 924978 | 8.9% |
| . | 899164 | 8.7% |
| $ | 899164 | 8.7% |
| 899164 | 8.7% | |
| 5 | 445569 | 4.3% |
| 1 | 409947 | 4.0% |
| 2 | 312909 | 3.0% |
| 3 | 238773 | 2.3% |
| 4 | 207077 | 2.0% |
| Other values (4) | 680348 | 6.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10374182 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 4457089 | |
| , | 924978 | 8.9% |
| . | 899164 | 8.7% |
| $ | 899164 | 8.7% |
| 899164 | 8.7% | |
| 5 | 445569 | 4.3% |
| 1 | 409947 | 4.0% |
| 2 | 312909 | 3.0% |
| 3 | 238773 | 2.3% |
| 4 | 207077 | 2.0% |
| Other values (4) | 680348 | 6.6% |
BalanceGross
Categorical
Imbalance 
| Distinct | 15 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.9 MiB |
| $0.00 | |
|---|---|
| $12,750.00 | 1 |
| $827,875.00 | 1 |
| $25,000.00 | 1 |
| $37,100.00 | 1 |
| Other values (10) | 10 |
Length
| Max length | 12 |
|---|---|
| Median length | 6 |
| Mean length | 6.0000767 |
| Min length | 6 |
Unique
| Unique | 14 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | $0.00 |
|---|---|
| 2nd row | $0.00 |
| 3rd row | $0.00 |
| 4th row | $0.00 |
| 5th row | $0.00 |
Common Values
| Value | Count | Frequency (%) |
| $0.00 | 899150 | |
| $12,750.00 | 1 | < 0.1% |
| $827,875.00 | 1 | < 0.1% |
| $25,000.00 | 1 | < 0.1% |
| $37,100.00 | 1 | < 0.1% |
| $43,127.00 | 1 | < 0.1% |
| $84,617.00 | 1 | < 0.1% |
| $1,760.00 | 1 | < 0.1% |
| $115,820.00 | 1 | < 0.1% |
| $996,262.00 | 1 | < 0.1% |
| Other values (5) | 5 | < 0.1% |
Length
| Value | Count | Frequency (%) |
| 0.00 | 899150 | |
| 12,750.00 | 1 | < 0.1% |
| 827,875.00 | 1 | < 0.1% |
| 25,000.00 | 1 | < 0.1% |
| 37,100.00 | 1 | < 0.1% |
| 43,127.00 | 1 | < 0.1% |
| 84,617.00 | 1 | < 0.1% |
| 1,760.00 | 1 | < 0.1% |
| 115,820.00 | 1 | < 0.1% |
| 996,262.00 | 1 | < 0.1% |
| Other values (5) | 5 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2697490 | |
| $ | 899164 | 16.7% |
| . | 899164 | 16.7% |
| 899164 | 16.7% | |
| , | 13 | < 0.1% |
| 1 | 11 | < 0.1% |
| 7 | 8 | < 0.1% |
| 2 | 7 | < 0.1% |
| 6 | 7 | < 0.1% |
| 9 | 7 | < 0.1% |
| Other values (4) | 18 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2697548 | |
| Other Punctuation | 899177 | 16.7% |
| Currency Symbol | 899164 | 16.7% |
| Space Separator | 899164 | 16.7% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2697490 | |
| 1 | 11 | < 0.1% |
| 7 | 8 | < 0.1% |
| 2 | 7 | < 0.1% |
| 6 | 7 | < 0.1% |
| 9 | 7 | < 0.1% |
| 5 | 6 | < 0.1% |
| 8 | 5 | < 0.1% |
| 4 | 4 | < 0.1% |
| 3 | 3 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 899164 | |
| , | 13 | < 0.1% |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 899164 |
Space Separator
| Value | Count | Frequency (%) |
| 899164 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 5395053 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 2697490 | |
| $ | 899164 | 16.7% |
| . | 899164 | 16.7% |
| 899164 | 16.7% | |
| , | 13 | < 0.1% |
| 1 | 11 | < 0.1% |
| 7 | 8 | < 0.1% |
| 2 | 7 | < 0.1% |
| 6 | 7 | < 0.1% |
| 9 | 7 | < 0.1% |
| Other values (4) | 18 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5395053 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 2697490 | |
| $ | 899164 | 16.7% |
| . | 899164 | 16.7% |
| 899164 | 16.7% | |
| , | 13 | < 0.1% |
| 1 | 11 | < 0.1% |
| 7 | 8 | < 0.1% |
| 2 | 7 | < 0.1% |
| 6 | 7 | < 0.1% |
| 9 | 7 | < 0.1% |
| Other values (4) | 18 | < 0.1% |
MIS_Status
Categorical
High correlation 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1997 |
| Missing (%) | 0.2% |
| Memory size | 6.9 MiB |
| P I F | |
|---|---|
| CHGOFF |
Length
| Max length | 6 |
|---|---|
| Median length | 5 |
| Mean length | 5.1756172 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | P I F |
|---|---|
| 2nd row | P I F |
| 3rd row | P I F |
| 4th row | P I F |
| 5th row | P I F |
Common Values
| Value | Count | Frequency (%) |
| P I F | 739609 | |
| CHGOFF | 157558 | 17.5% |
| (Missing) | 1997 | 0.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| p | 739609 | |
| i | 739609 | |
| f | 739609 | |
| chgoff | 157558 | 6.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1479218 | ||
| F | 1054725 | |
| P | 739609 | |
| I | 739609 | |
| C | 157558 | 3.4% |
| H | 157558 | 3.4% |
| G | 157558 | 3.4% |
| O | 157558 | 3.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 3164175 | |
| Space Separator | 1479218 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 1054725 | |
| P | 739609 | |
| I | 739609 | |
| C | 157558 | 5.0% |
| H | 157558 | 5.0% |
| G | 157558 | 5.0% |
| O | 157558 | 5.0% |
Space Separator
| Value | Count | Frequency (%) |
| 1479218 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3164175 | |
| Common | 1479218 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| F | 1054725 | |
| P | 739609 | |
| I | 739609 | |
| C | 157558 | 5.0% |
| H | 157558 | 5.0% |
| G | 157558 | 5.0% |
| O | 157558 | 5.0% |
Common
| Value | Count | Frequency (%) |
| 1479218 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4643393 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1479218 | ||
| F | 1054725 | |
| P | 739609 | |
| I | 739609 | |
| C | 157558 | 3.4% |
| H | 157558 | 3.4% |
| G | 157558 | 3.4% |
| O | 157558 | 3.4% |
ChgOffPrinGr
Text
| Distinct | 83165 |
|---|---|
| Distinct (%) | 9.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.9 MiB |
Length
| Max length | 14 |
|---|---|
| Median length | 6 |
| Mean length | 6.8997235 |
| Min length | 6 |
Unique
| Unique | 52342 ? |
|---|---|
| Unique (%) | 5.8% |
Sample
| 1st row | $0.00 |
|---|---|
| 2nd row | $0.00 |
| 3rd row | $0.00 |
| 4th row | $0.00 |
| 5th row | $0.00 |
| Value | Count | Frequency (%) |
| 0.00 | 737152 | |
| 50,000.00 | 2110 | 0.2% |
| 10,000.00 | 1865 | 0.2% |
| 25,000.00 | 1371 | 0.2% |
| 35,000.00 | 1345 | 0.1% |
| 100,000.00 | 1028 | 0.1% |
| 20,000.00 | 594 | 0.1% |
| 30,000.00 | 492 | 0.1% |
| 15,000.00 | 467 | 0.1% |
| 5,000.00 | 356 | < 0.1% |
| Other values (83155) | 152384 | 16.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2643222 | |
| $ | 899164 | 14.5% |
| . | 899164 | 14.5% |
| 899164 | 14.5% | |
| , | 161591 | 2.6% |
| 1 | 98607 | 1.6% |
| 2 | 88727 | 1.4% |
| 4 | 86077 | 1.4% |
| 9 | 81470 | 1.3% |
| 3 | 79226 | 1.3% |
| Other values (4) | 267571 | 4.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3344900 | |
| Other Punctuation | 1060755 | 17.1% |
| Currency Symbol | 899164 | 14.5% |
| Space Separator | 899164 | 14.5% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2643222 | |
| 1 | 98607 | 2.9% |
| 2 | 88727 | 2.7% |
| 4 | 86077 | 2.6% |
| 9 | 81470 | 2.4% |
| 3 | 79226 | 2.4% |
| 5 | 71099 | 2.1% |
| 8 | 66886 | 2.0% |
| 7 | 65400 | 2.0% |
| 6 | 64186 | 1.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 899164 | |
| , | 161591 | 15.2% |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 899164 |
Space Separator
| Value | Count | Frequency (%) |
| 899164 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 6203983 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 2643222 | |
| $ | 899164 | 14.5% |
| . | 899164 | 14.5% |
| 899164 | 14.5% | |
| , | 161591 | 2.6% |
| 1 | 98607 | 1.6% |
| 2 | 88727 | 1.4% |
| 4 | 86077 | 1.4% |
| 9 | 81470 | 1.3% |
| 3 | 79226 | 1.3% |
| Other values (4) | 267571 | 4.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6203983 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 2643222 | |
| $ | 899164 | 14.5% |
| . | 899164 | 14.5% |
| 899164 | 14.5% | |
| , | 161591 | 2.6% |
| 1 | 98607 | 1.6% |
| 2 | 88727 | 1.4% |
| 4 | 86077 | 1.4% |
| 9 | 81470 | 1.3% |
| 3 | 79226 | 1.3% |
| Other values (4) | 267571 | 4.3% |
GrAppv
Text
| Distinct | 22128 |
|---|---|
| Distinct (%) | 2.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.9 MiB |
Length
| Max length | 14 |
|---|---|
| Median length | 12 |
| Mean length | 11.513319 |
| Min length | 8 |
Unique
| Unique | 13651 ? |
|---|---|
| Unique (%) | 1.5% |
Sample
| 1st row | $60,000.00 |
|---|---|
| 2nd row | $40,000.00 |
| 3rd row | $287,000.00 |
| 4th row | $35,000.00 |
| 5th row | $229,000.00 |
| Value | Count | Frequency (%) |
| 50,000.00 | 69394 | 7.7% |
| 25,000.00 | 51258 | 5.7% |
| 100,000.00 | 50977 | 5.7% |
| 10,000.00 | 38366 | 4.3% |
| 150,000.00 | 27624 | 3.1% |
| 20,000.00 | 23434 | 2.6% |
| 35,000.00 | 23181 | 2.6% |
| 30,000.00 | 21004 | 2.3% |
| 5,000.00 | 19146 | 2.1% |
| 15,000.00 | 18472 | 2.1% |
| Other values (22118) | 556308 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 4946152 | |
| , | 925342 | 8.9% |
| . | 899164 | 8.7% |
| $ | 899164 | 8.7% |
| 899164 | 8.7% | |
| 5 | 450225 | 4.3% |
| 1 | 345271 | 3.3% |
| 2 | 266534 | 2.6% |
| 3 | 180629 | 1.7% |
| 4 | 133995 | 1.3% |
| Other values (4) | 406722 | 3.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 6729528 | |
| Other Punctuation | 1824506 | 17.6% |
| Currency Symbol | 899164 | 8.7% |
| Space Separator | 899164 | 8.7% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 4946152 | |
| 5 | 450225 | 6.7% |
| 1 | 345271 | 5.1% |
| 2 | 266534 | 4.0% |
| 3 | 180629 | 2.7% |
| 4 | 133995 | 2.0% |
| 7 | 120134 | 1.8% |
| 6 | 110952 | 1.6% |
| 8 | 98042 | 1.5% |
| 9 | 77594 | 1.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 925342 | |
| . | 899164 |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 899164 |
Space Separator
| Value | Count | Frequency (%) |
| 899164 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 10352362 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 4946152 | |
| , | 925342 | 8.9% |
| . | 899164 | 8.7% |
| $ | 899164 | 8.7% |
| 899164 | 8.7% | |
| 5 | 450225 | 4.3% |
| 1 | 345271 | 3.3% |
| 2 | 266534 | 2.6% |
| 3 | 180629 | 1.7% |
| 4 | 133995 | 1.3% |
| Other values (4) | 406722 | 3.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10352362 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 4946152 | |
| , | 925342 | 8.9% |
| . | 899164 | 8.7% |
| $ | 899164 | 8.7% |
| 899164 | 8.7% | |
| 5 | 450225 | 4.3% |
| 1 | 345271 | 3.3% |
| 2 | 266534 | 2.6% |
| 3 | 180629 | 1.7% |
| 4 | 133995 | 1.3% |
| Other values (4) | 406722 | 3.9% |
SBA_Appv
Text
| Distinct | 38326 |
|---|---|
| Distinct (%) | 4.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.9 MiB |
Length
| Max length | 14 |
|---|---|
| Median length | 11 |
| Mean length | 11.308074 |
| Min length | 8 |
Unique
| Unique | 23260 ? |
|---|---|
| Unique (%) | 2.6% |
Sample
| 1st row | $48,000.00 |
|---|---|
| 2nd row | $32,000.00 |
| 3rd row | $215,250.00 |
| 4th row | $28,000.00 |
| 5th row | $229,000.00 |
| Value | Count | Frequency (%) |
| 25,000.00 | 49579 | 5.5% |
| 12,500.00 | 40147 | 4.5% |
| 5,000.00 | 31135 | 3.5% |
| 50,000.00 | 25047 | 2.8% |
| 10,000.00 | 17009 | 1.9% |
| 17,500.00 | 16141 | 1.8% |
| 15,000.00 | 14490 | 1.6% |
| 7,500.00 | 12781 | 1.4% |
| 127,500.00 | 11946 | 1.3% |
| 80,000.00 | 10965 | 1.2% |
| Other values (38316) | 669924 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 4048030 | |
| , | 908994 | 8.9% |
| . | 899164 | 8.8% |
| $ | 899164 | 8.8% |
| 899164 | 8.8% | |
| 5 | 654346 | 6.4% |
| 2 | 433556 | 4.3% |
| 1 | 386969 | 3.8% |
| 7 | 251493 | 2.5% |
| 3 | 186643 | 1.8% |
| Other values (4) | 600290 | 5.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 6561327 | |
| Other Punctuation | 1808158 | 17.8% |
| Currency Symbol | 899164 | 8.8% |
| Space Separator | 899164 | 8.8% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 4048030 | |
| 5 | 654346 | 10.0% |
| 2 | 433556 | 6.6% |
| 1 | 386969 | 5.9% |
| 7 | 251493 | 3.8% |
| 3 | 186643 | 2.8% |
| 4 | 180754 | 2.8% |
| 6 | 151450 | 2.3% |
| 8 | 150215 | 2.3% |
| 9 | 117871 | 1.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 908994 | |
| . | 899164 |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 899164 |
Space Separator
| Value | Count | Frequency (%) |
| 899164 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 10167813 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 4048030 | |
| , | 908994 | 8.9% |
| . | 899164 | 8.8% |
| $ | 899164 | 8.8% |
| 899164 | 8.8% | |
| 5 | 654346 | 6.4% |
| 2 | 433556 | 4.3% |
| 1 | 386969 | 3.8% |
| 7 | 251493 | 2.5% |
| 3 | 186643 | 1.8% |
| Other values (4) | 600290 | 5.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10167813 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 4048030 | |
| , | 908994 | 8.9% |
| . | 899164 | 8.8% |
| $ | 899164 | 8.8% |
| 899164 | 8.8% | |
| 5 | 654346 | 6.4% |
| 2 | 433556 | 4.3% |
| 1 | 386969 | 3.8% |
| 7 | 251493 | 2.5% |
| 3 | 186643 | 1.8% |
| Other values (4) | 600290 | 5.9% |
TARGET
Categorical
High correlation 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.9 MiB |
| 0.0 | |
|---|---|
| 1.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 739609 | |
| 1.0 | 159555 | 17.7% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 739609 | |
| 1.0 | 159555 | 17.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1638773 | |
| . | 899164 | |
| 1 | 159555 | 5.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1798328 | |
| Other Punctuation | 899164 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1638773 | |
| 1 | 159555 | 8.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 899164 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2697492 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1638773 | |
| . | 899164 | |
| 1 | 159555 | 5.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2697492 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1638773 | |
| . | 899164 | |
| 1 | 159555 | 5.9% |
Interactions
Correlations
| BalanceGross | CreateJob | FranchiseCode | LowDoc | MIS_Status | NAICS | NewExist | NoEmp | RetainedJob | RevLineCr | TARGET | Term | UrbanRural | Zip | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| BalanceGross | 1.000 | 0.000 | 0.005 | 0.000 | 0.000 | 0.001 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.002 | 0.001 |
| CreateJob | 0.000 | 1.000 | -0.054 | 0.003 | 0.012 | 0.157 | 0.009 | 0.034 | 0.377 | 0.011 | 0.012 | 0.082 | 0.025 | 0.026 |
| FranchiseCode | 0.005 | -0.054 | 1.000 | 0.014 | 0.022 | -0.091 | 0.099 | 0.121 | -0.263 | 0.044 | 0.023 | 0.196 | 0.013 | 0.031 |
| LowDoc | 0.000 | 0.003 | 0.014 | 1.000 | 0.088 | 0.060 | 0.116 | 0.000 | 0.003 | 0.087 | 0.088 | 0.068 | 0.157 | 0.059 |
| MIS_Status | 0.000 | 0.012 | 0.022 | 0.088 | 1.000 | 0.148 | 0.022 | 0.004 | 0.013 | 0.146 | 1.000 | 0.492 | 0.211 | 0.081 |
| NAICS | 0.001 | 0.157 | -0.091 | 0.060 | 0.148 | 1.000 | 0.094 | -0.154 | 0.271 | 0.124 | 0.148 | -0.081 | 0.432 | -0.034 |
| NewExist | 0.000 | 0.009 | 0.099 | 0.116 | 0.022 | 0.094 | 1.000 | 0.005 | 0.002 | 0.065 | 0.022 | 0.088 | 0.030 | 0.088 |
| NoEmp | 0.000 | 0.034 | 0.121 | 0.000 | 0.004 | -0.154 | 0.005 | 1.000 | 0.124 | 0.000 | 0.004 | 0.200 | 0.010 | 0.059 |
| RetainedJob | 0.000 | 0.377 | -0.263 | 0.003 | 0.013 | 0.271 | 0.002 | 0.124 | 1.000 | 0.010 | 0.012 | -0.157 | 0.025 | -0.026 |
| RevLineCr | 0.000 | 0.011 | 0.044 | 0.087 | 0.146 | 0.124 | 0.065 | 0.000 | 0.010 | 1.000 | 0.146 | 0.140 | 0.348 | 0.056 |
| TARGET | 0.000 | 0.012 | 0.023 | 0.088 | 1.000 | 0.148 | 0.022 | 0.004 | 0.012 | 0.146 | 1.000 | 0.490 | 0.212 | 0.079 |
| Term | 0.000 | 0.082 | 0.196 | 0.068 | 0.492 | -0.081 | 0.088 | 0.200 | -0.157 | 0.140 | 0.490 | 1.000 | 0.207 | 0.142 |
| UrbanRural | 0.002 | 0.025 | 0.013 | 0.157 | 0.211 | 0.432 | 0.030 | 0.010 | 0.025 | 0.348 | 0.212 | 0.207 | 1.000 | 0.126 |
| Zip | 0.001 | 0.026 | 0.031 | 0.059 | 0.081 | -0.034 | 0.088 | 0.059 | -0.026 | 0.056 | 0.079 | 0.142 | 0.126 | 1.000 |
Missing values
Sample
| State | Zip | NAICS | ApprovalDate | ApprovalFY | Term | NoEmp | NewExist | CreateJob | RetainedJob | FranchiseCode | UrbanRural | RevLineCr | LowDoc | ChgOffDate | DisbursementDate | DisbursementGross | BalanceGross | MIS_Status | ChgOffPrinGr | GrAppv | SBA_Appv | TARGET | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | IN | 47711 | 451120 | 28-Feb-97 | 1997 | 84 | 4 | 2.0 | 0 | 0 | 1 | 0 | N | Y | NaN | 28-Feb-99 | $60,000.00 | $0.00 | P I F | $0.00 | $60,000.00 | $48,000.00 | 0.0 |
| 1 | IN | 46526 | 722410 | 28-Feb-97 | 1997 | 60 | 2 | 2.0 | 0 | 0 | 1 | 0 | N | Y | NaN | 31-May-97 | $40,000.00 | $0.00 | P I F | $0.00 | $40,000.00 | $32,000.00 | 0.0 |
| 2 | IN | 47401 | 621210 | 28-Feb-97 | 1997 | 180 | 7 | 1.0 | 0 | 0 | 1 | 0 | N | N | NaN | 31-Dec-97 | $287,000.00 | $0.00 | P I F | $0.00 | $287,000.00 | $215,250.00 | 0.0 |
| 3 | OK | 74012 | 0 | 28-Feb-97 | 1997 | 60 | 2 | 1.0 | 0 | 0 | 1 | 0 | N | Y | NaN | 30-Jun-97 | $35,000.00 | $0.00 | P I F | $0.00 | $35,000.00 | $28,000.00 | 0.0 |
| 4 | FL | 32801 | 0 | 28-Feb-97 | 1997 | 240 | 14 | 1.0 | 7 | 7 | 1 | 0 | N | N | NaN | 14-May-97 | $229,000.00 | $0.00 | P I F | $0.00 | $229,000.00 | $229,000.00 | 0.0 |
| 5 | CT | 6062 | 332721 | 28-Feb-97 | 1997 | 120 | 19 | 1.0 | 0 | 0 | 1 | 0 | N | N | NaN | 30-Jun-97 | $517,000.00 | $0.00 | P I F | $0.00 | $517,000.00 | $387,750.00 | 0.0 |
| 6 | NJ | 7083 | 0 | 2-Jun-80 | 1980 | 45 | 45 | 2.0 | 0 | 0 | 0 | 0 | N | N | 24-Jun-91 | 22-Jul-80 | $600,000.00 | $0.00 | CHGOFF | $208,959.00 | $600,000.00 | $499,998.00 | 1.0 |
| 7 | FL | 34491 | 811118 | 28-Feb-97 | 1997 | 84 | 1 | 2.0 | 0 | 0 | 1 | 0 | N | Y | NaN | 30-Jun-98 | $45,000.00 | $0.00 | P I F | $0.00 | $45,000.00 | $36,000.00 | 0.0 |
| 8 | FL | 32456 | 721310 | 28-Feb-97 | 1997 | 297 | 2 | 2.0 | 0 | 0 | 1 | 0 | N | N | NaN | 31-Jul-97 | $305,000.00 | $0.00 | P I F | $0.00 | $305,000.00 | $228,750.00 | 0.0 |
| 9 | CT | 6073 | 0 | 28-Feb-97 | 1997 | 84 | 3 | 2.0 | 0 | 0 | 1 | 0 | N | Y | NaN | 30-Apr-97 | $70,000.00 | $0.00 | P I F | $0.00 | $70,000.00 | $56,000.00 | 0.0 |
| State | Zip | NAICS | ApprovalDate | ApprovalFY | Term | NoEmp | NewExist | CreateJob | RetainedJob | FranchiseCode | UrbanRural | RevLineCr | LowDoc | ChgOffDate | DisbursementDate | DisbursementGross | BalanceGross | MIS_Status | ChgOffPrinGr | GrAppv | SBA_Appv | TARGET | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 899154 | OH | 44405 | 0 | 27-Feb-97 | 1997 | 60 | 1 | 1.0 | 0 | 0 | 1 | 0 | 0 | N | NaN | 30-Sep-97 | $10,000.00 | $0.00 | P I F | $0.00 | $10,000.00 | $5,000.00 | 0.0 |
| 899155 | NY | 11420 | 624410 | 27-Feb-97 | 1997 | 180 | 2 | 1.0 | 0 | 0 | 1 | 0 | 0 | N | NaN | 30-Jun-97 | $123,000.00 | $0.00 | P I F | $0.00 | $128,000.00 | $96,000.00 | 0.0 |
| 899156 | MD | 21224 | 332431 | 27-Feb-97 | 1997 | 60 | 20 | 1.0 | 0 | 0 | 1 | 0 | 0 | N | NaN | 30-Jun-97 | $50,000.00 | $0.00 | P I F | $0.00 | $50,000.00 | $25,000.00 | 0.0 |
| 899157 | CA | 92020 | 314912 | 27-Feb-97 | 1997 | 36 | 40 | 1.0 | 0 | 0 | 1 | 0 | N | N | NaN | 31-Mar-97 | $200,000.00 | $0.00 | P I F | $0.00 | $200,000.00 | $150,000.00 | 0.0 |
| 899158 | TX | 75062 | 0 | 27-Feb-97 | 1997 | 84 | 5 | 2.0 | 0 | 0 | 1 | 0 | N | Y | NaN | 30-Jun-97 | $79,000.00 | $0.00 | P I F | $0.00 | $79,000.00 | $63,200.00 | 0.0 |
| 899159 | OH | 43221 | 451120 | 27-Feb-97 | 1997 | 60 | 6 | 1.0 | 0 | 0 | 1 | 0 | 0 | N | NaN | 30-Sep-97 | $70,000.00 | $0.00 | P I F | $0.00 | $70,000.00 | $56,000.00 | 0.0 |
| 899160 | OH | 43221 | 451130 | 27-Feb-97 | 1997 | 60 | 6 | 1.0 | 0 | 0 | 1 | 0 | Y | N | NaN | 31-Oct-97 | $85,000.00 | $0.00 | P I F | $0.00 | $85,000.00 | $42,500.00 | 0.0 |
| 899161 | CA | 93455 | 332321 | 27-Feb-97 | 1997 | 108 | 26 | 1.0 | 0 | 0 | 1 | 0 | N | N | NaN | 30-Sep-97 | $300,000.00 | $0.00 | P I F | $0.00 | $300,000.00 | $225,000.00 | 0.0 |
| 899162 | HI | 96830 | 0 | 27-Feb-97 | 1997 | 60 | 6 | 1.0 | 0 | 0 | 1 | 0 | N | Y | 8-Mar-00 | 31-Mar-97 | $75,000.00 | $0.00 | CHGOFF | $46,383.00 | $75,000.00 | $60,000.00 | 1.0 |
| 899163 | HI | 96734 | 0 | 27-Feb-97 | 1997 | 48 | 1 | 2.0 | 0 | 0 | 1 | 0 | N | N | NaN | 31-May-97 | $30,000.00 | $0.00 | P I F | $0.00 | $30,000.00 | $24,000.00 | 0.0 |
Duplicate rows
Most frequently occurring
| State | Zip | NAICS | ApprovalDate | ApprovalFY | Term | NoEmp | NewExist | CreateJob | RetainedJob | FranchiseCode | UrbanRural | RevLineCr | LowDoc | ChgOffDate | DisbursementDate | DisbursementGross | BalanceGross | MIS_Status | ChgOffPrinGr | GrAppv | SBA_Appv | TARGET | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 12 | CA | 93401 | 484210 | 11-Jan-08 | 2008 | 12 | 145 | 1.0 | 124 | 145 | 1 | 2 | Y | N | NaN | 31-Jan-08 | $1,500.00 | $0.00 | NaN | $0.00 | $1,500.00 | $750.00 | 1.0 | 20 |
| 22 | CA | 93401 | 484210 | 30-Jan-09 | 2009 | 12 | 110 | 1.0 | 25 | 110 | 1 | 2 | Y | N | NaN | 31-Mar-09 | $1,500.00 | $0.00 | NaN | $0.00 | $1,000.00 | $500.00 | 1.0 | 8 |
| 17 | CA | 93401 | 484210 | 20-Sep-05 | 2005 | 12 | 82 | 2.0 | 82 | 0 | 0 | 2 | N | N | NaN | 28-Feb-06 | $4,000.00 | $0.00 | P I F | $0.00 | $4,000.00 | $2,000.00 | 0.0 | 6 |
| 103 | PA | 15237 | 233210 | 6-Aug-96 | 1996 | 52 | 12 | 1.0 | 0 | 0 | 1 | 0 | N | N | NaN | 30-Nov-96 | $80,000.00 | $0.00 | P I F | $0.00 | $80,000.00 | $60,000.00 | 0.0 | 6 |
| 10 | CA | 93292 | 233110 | 29-Sep-03 | 2003 | 12 | 3 | 1.0 | 0 | 0 | 1 | 0 | N | N | NaN | 31-Oct-03 | $285,900.00 | $0.00 | P I F | $0.00 | $285,900.00 | $214,425.00 | 0.0 | 4 |
| 13 | CA | 93401 | 484210 | 11-Jan-08 | 2008 | 12 | 145 | 1.0 | 124 | 145 | 1 | 2 | Y | N | NaN | 31-Jan-08 | $3,000.00 | $0.00 | NaN | $0.00 | $3,000.00 | $1,500.00 | 1.0 | 4 |
| 21 | CA | 93401 | 484210 | 25-Jan-07 | 2007 | 12 | 82 | 1.0 | 0 | 82 | 1 | 2 | Y | N | NaN | 28-Feb-07 | $5,000.00 | $0.00 | P I F | $0.00 | $4,000.00 | $2,000.00 | 0.0 | 4 |
| 102 | PA | 15221 | 233210 | 8-Sep-94 | 1994 | 24 | 15 | 1.0 | 0 | 0 | 1 | 0 | N | N | NaN | 30-Apr-95 | $88,125.00 | $0.00 | P I F | $0.00 | $88,125.00 | $74,906.00 | 0.0 | 4 |
| 118 | TN | 37211 | 233320 | 15-Mar-91 | 1991 | 3 | 10 | 1.0 | 0 | 0 | 1 | 0 | N | N | NaN | 31-Jul-91 | $10,000.00 | $0.00 | P I F | $0.00 | $10,000.00 | $8,500.00 | 0.0 | 4 |
| 14 | CA | 93401 | 484210 | 11-Jan-08 | 2008 | 12 | 145 | 1.0 | 124 | 145 | 1 | 2 | Y | N | NaN | 31-Jan-08 | $4,000.00 | $0.00 | P I F | $0.00 | $4,000.00 | $2,000.00 | 0.0 | 3 |